Abstract. Recently WolfF [25] obtained a nearly sharp bilinear restriction 
theorem for bounded subsets of the cone in general dimension. We obtain the 
endpoint of Wolff's estimate and generalize to the case when one of the subsets 
is large. As a consequence, we are able to deduce some nearly-sharp null 
form estimates. 
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terence tao 
1. Introduction 

Let n > 2 be a fixed integer. We say that a function (j) : R^^^ H is a red 
wave if it takes values in a finite dimensional complex Hilbert space H, and its 
space-time Fourier transform </> is an measure on the set 

^kj.red 1^1) . ^(^^ < ^, < 1^1 < 2^+1} 

for some integer fc, where ei is a fixed basis vector. Similarly, we say that ^ : 
R"^^ — !■ i?' is a blue wave if it takes values in a finite dimensional complex Hilbert 
space H' and ■0 is an measure on 

^k^Mue {(^^_|^|) : z{te^) < ^,2'^- < lei < 2'=+!} 

for some integer k. In both cases we call 2*^ the frequency of the waves (j>, ip. 

Red and blue waves both solve the free wave equation, but propagate along 
different sets of characteristics. Note that blue waves are the time reversal of 
red waves. Also, these waves are automatically smooth and bounded thanks to 
the frequency localization. The vector valued formulation will be convenient for 
inductive reasons, but our implicit constants shall always be independent of H and 
H'. 

We define the energy of 4>, ip by 
(1) E{^):^\\m\l E{^):^m)\\l 

where t G R is arbitrary. This definition is independent of the choice of and is 
related to the standard notion of energy by the formula 



Throughout the paper, po = Poi'"^) will denote the exponent 

n + 3 



(2) Po := 



n+1 

The main result of this paper is the following bilinear estimate. 
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Theorem 1.1. Let (p he a. red wave of frequency 1, and ip be a blue wave of fre- 
quency 2^ for some k>0. Then we have 

(3) \\<f»P\\p < 2'=(^-^+-)£(</.)V2e(V)1/' 

for all 2 > p> po, e > 0. In particular, (j), tp have frequency 1 then 

(4) < Eicj^f'^Eii^f'^ 

In the above theorem and in the sequel, the imphcit constants may depend on 
e but are independent of H, H', and (j)tp : R""*"^ H ® H' denotes the tensor 
product (j)® if} oi 4> and if). 

The estimate (4) solves a conjecture of Machedon and Klainerman. The restric- 
tion p > po is sharp; see e.g. [7], [22], [21]. For 2 < p < oo the theory is much 
simpler, and the best possible estimate is 

II^V'Ilp < 2'="(^-i)i?(</))i/2i?(^)i/2. 

This is easily proved from the p = 2 case and Sobolev embedding. 

The estimate (4) is a genuinely bilinear estimate and cannot be proven directly 
from linear estimates. Indeed, the Strichartz estimate [19] combined with the Holder 
inequality only yields the range p > (n + l)/(n — 1), while Plancherel's theorem 
and Cauchy-Schwarz only gives the range p>2 (see e.g. [11]). In the n = 2 case, 
the fact that one could go below p — 2 was first shown by Bourgain [2], and in 
[22] a concrete range of p was given, namely p> 2 — More recently, Wolff [25] 
obtained the range p> po for all dimensions n. Thus (4) is the endpoint of that in 
[25]. 

The generalization (3) of (4) is necessary in order to develop sharp null form 
estimates, as we shall see in Section 17. The estimate (3) is sharp except for the 
e. To sec this, let ?/' be a blue wave whose Fourier transform is supported in a unit 
"square" in 2^Y,i,iue, and which is comparable to 1 on a 2*^ x 1 tube oriented in a 
blue null direction, and let = (0^)^^^ be a vector- valued red wave of frequency 1 
such that (pi is comparable to 1 on B^, where the Bi arc a family of 2^ unit balls 
that cover the above tube. The e term can probably be removed when p > po. 
When p = pq the author conjectures that e can still be removed, but this seems 
to require an extremely delicate analysis and some new Kakeya estimates for null 
rays. However, it should be fairly straightforward to replace the 2^*^ factor by a 
polynomial kP in this case. 

Broadly, our strategy to prove Theorem 1.1 is as follows. We shall localize the 
estimate (3) to a cube Q of side- length 2'', and obtain a bound independent 
of R. This will be obtained by induction on R, as follows. 

If (j) and tp are dispersed fairly evenly throughout Q. we shall decompose Q into 
sub-cubes of side- length 2~'-''°R and decompose </> and -0 into smaller waves, each 
of which is concentrated on one of these sub-cubes. By an interpolation between 
bilinear and estimates as in Wolff [25], the cross-terms are well controlled, 
and one can replace 4> and by a "quilt" of waves on the 2~'-''°R-cuhes. One then 
applies the induction hypothesis to each sub-cube and sums up. 

This tactic works well when and ijj are disperscxi, but there is a problem when 
almost all the energy of (j) and tjj simultaneously concentrate in a disk D of radius 
r <^ R. By Huygens' principle the wave (pip is then concentrated in the double 
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light cone generated by D. Restricting to this smaller set, we can exploit a more 
favourable bilinear estimate (Corollary 13.2) than the more trivial bilinear 
estimate (27) used in the non-concentrated case. One can then repeat the non- 
concentrated argument, and localize ^ and tp all the way down to the scale of r, at 
which point the waves become non-concentrated and one obtains enough of a gain 
to close the induction. 

The proof is unfortunately rather complex. In an attempt to give the reader 
a sense of the full argument without drowning in the details, the author has ab- 
stracted the argument into several sections. We first give the top-level argument, 
in which Theorem 1.1 is deduced from several major propositions. Then, we give 
the medium-level argument, in which these major propositions are deduced from 
some elementary estimates and a key estimate, Proposition 4.1. Finally, we devote 
several sections to the proof of Proposition 4.1. We have tried to make each section 
as self-contained as possible, so that the arguments in each section rely only on 
the Propositions and Lemmata of the previous sections, and not on the method of 
arguments or on notation specific to a single section. 

The author wishes to thank Damiano Foschi, Sergiu Klainerman, Christoph 
Thiele, and Tom Wolff for many helpful discussions, and for making their preprints 
readily available. This work was conducted at UCLA and at UNSW, with ad- 
ditional support from NSF grant DMS-9706764 and grants from the Sloan and 
Packard foundations. The author is also indebted to the referee for finding some 
errors in the first version of this paper (which claimed (3) without the 2^*^ factor). 

2. Some notation 

In this section we list some notation which will be used throughout the argument. 

We fix A'' » 1 to be a large integer depending only on n {N = 2" will suSice); 
the disclaimer "assuming N is sufficiently large" will be implicit throughout our 
arguments. We also let < e <C 1 be an arbitrary small number. We also let Co 
denote an integer much larger than N, 1/e; for instance, 

(5) Co := 2W^J" 

will suffice. Generally speaking, we use N as a large exponent, and Co as a very 
large constant (large enough to dominate any reasonable quantity arising from N 
or e). As a rule of thumb, any term containing an or factor may be 

ignored for all practical purposes; these error terms only arise because one cannot 
quite simultaneously localize in both space and frequency. 

We use the notation A := B to indicate that A is being defined to equal B. We 
shall extend this notation in several ways, for instance / := g indicates that / is 
being defined via the Fourier transform. 

We let C denote various large numbers that vary from line to line (possibly 
depending on A^, e, but will not depend explicitly on Co), and let A < B or 
A = 0(B) denote the estimate A < CB where C depends only on n and e. Similarly 
we use A <^ B to denote A < C~^B. Generally speaking, we shall use the < 
notation to control error terms, but will need the more precise < notation for the 
main terms due to the inductive nature of the argument. In particular, we will be 
dealing with many estimates of the form A < B + CE, A < {1 + Cc)B + c~'-' E, 
ov A < (1 + CNc)B + c~^E, where B is the main term, E is the error term, and 
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< c <C 1 is sonic small parameter which we may optimize in later. For such 
estimates it will be very important that the factor in front of B is either 1 or very 
close to 1, as we will be unable to close the induction otherwise. 

Note that our constants C are independent of the dimension of the spaces H, 
H' . This will be important for the induction argument. 

If <f){x, t) is a function of both space and time, we use to denote the spatial 
function <f){t){x) := 4>{x,t). 

We use the hat notation for the spatial Fourier transform 

as well as the spacetime Fourier transform 

'^(^'^) f „^,e-'"'^"'^+*"V(a;,i) dxdt, 

with the meaning being clear from context. We define the frequency support supp(/) 
of a function / to be the support of the Fourier transform /. 

We shall always treat R""*"^ as endowed with the Euclidean metric and never with 
the Minkowski metric, so that terms such as length \x\, angle Z{x,y), etc. retain 
their usual meaning in R""*"^. On the other hand, we will employ the Lorentz 
transforms on occasion, especially when we derive null form estimates in Section 
17. 

We now define some familiar geometric objects, namely disks, cubes, cones, and 
conic neighbourhoods. 

A disk will be any subset D of R""*"^ of the form 

D = D{xD,tD\rD) ■■= {{x,tD) : \x - xd\ < td} 

for some {xD,tD) G R"^^ and ro > 0. The reader should note that disks are 
n-dimensional objects even though they reside in R"^^. We call to the time co- 
ordinate of the disk D. If D is a disk, we define the cutoff function xd on R""*"^ 
by 

(6) xnix,t):={l + \^^r--. 

li D = D{x]j,t]j-.r]j) is a disk and c > 0, we define cD to be the disk cD = 
D{xD,tD',crD), and the disk exterior D^^'^ = Z)'^^*(a;£), t/j; ru) to be the region 

D'''^\xD,tD;rD) ■■= {{x,tD) •■ \x-xd\ >rD}- 

We endow disks and disk exteriors with spatial Lebesguc measure dx. 

We define Q{xQ,tQ;rQ) to be the n+ 1-dimensional cube in R"+^ centered at 
{xQ,tQ) with side-length rg and with sides parallel to the axes. We call the interval 
[tQ - rQ/2, tq + rQ/2] the lifespan of Q. If Q = Q{xQ,tQ: rg) is a cube and c > 0, 
we use cQ to denote the cube cQ := Q{xQ,tQ;crQ). Finally, we define the cubical 
annuli Q""" (xg , i g ; n , r2 ) by 

Q"""(a;g,tg;ri,r2) := <3(xq, tg; r2)\Q(x'g, tg; ri). 
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Let S = S„ denote the spatial region 

S:=U:i<|^|<4,Z(e,eO< J}. 

If {xo,to) € R""''^, define the red cone with vertex {xo,to) to be the set 
C^^ixo, to) := {(a;o + rw,to - r) : r G R,cu G 5"-^ n S}, 

and the blue cone at this vertex to be 

C''^"''{xo, to) := {{xo + ru,to + r) -.r eR.,u! e S"'^ n E}. 

For any r > 1, wc define C'^'^ixa, to:r), C^'"«(.to, to; r) to be the r-neiffhbourhoods 
of C"''^''(a;o,to), C'''"'^(a;o, io) respectively. Finally, we define the combined conic 
neighbourhood C^'"'"^''^(a;o, to; r) to be 

"'-f (xo , to ; r ) := C"^^'' (xo , to ; r ) U C"'"^ {xo , to ; r) . 

For any integer j, let Dj denote the dilation operator 

'Dj(j){x,t) := ^(2%,2^t), 

and T to be the time reversal operator 

T<i){x,t) := (l){x,-t). 

The operator T maps red waves onto blue waves and vice versa, while the operator 
D j maps waves of frequency 1 onto waves of frequency 2-' . 

In our induction argument we shall frequently be decomposing red and blue 
waves into smaller waves. Unfortunately, these decompositions often enlarge the 
Fourier support of the waves slightly, which is a potential obstruction to closing the 
induction. To get around this we introduce the notion of the margin margin(0) of 
a wave (p. More precisely, if ^ is a red wave of frequency 1, we define margin(0) to 
be the non-negative real number 

margin(^) := dist(supp(<^), 

where T,^ denotes the light cones 

E± :={(^,±|^|):^eR"}. 

We extend this definition to general red or blue waves by the formulae 

margin(TD/t;(j!)) := margin(Dfc(j!)) := margin(^). 

Thus, for instance, if V' is a blue wave of frequency 2*^, then 

margin(Vj) ;= 2-'=dist(supp(v3), 2* (E-\S""'-')). 

The concept of margin is only needed to overcome the technical obstruction men- 
tioned earlier, and otherwise plays no role of importance. 
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3. The top-level proof of Theorem 1.1 

In this section we give the top-level proof of Theorem 1.1, in which wc state the 
key propositions and give the inductive argument which yields the Theorem from 
these propositions. We shall prove these propositions in later sections. 

Throughout the proof ri>2, fc>0, e>0 will be fixed. It suffices to verify the 
case k > 2*^0 , since the k < 2'^o case follows by applying a Lorentz transform. 
We shall thus assume k > 2'-^o throughout the argument. In particular, any factor 
depending on k will dwarf any factor depending on Co, which in turn dwarfs any 
factor depending on N, which in turn dwarfs any absolute constants. 

Wc can also assume that A; is a multiple of Cg. This is a purely notational 
convenience that we shall only use in Section 16. 

It is known (see e.g. [2], [11], [22], [16], [7], or Lemma 14.2 below) that 

(7) Un2<E{cl>)y'E{^)y\ 

Thus Theorem 1.1 holds for p = 2. By interpolation it thus suffices to prove the 
theorem when 

(8) p = p, = -—, 

and we shall implicitly assume this throughout the proof. 

Definition 3.1. For any R > CqI^ ^ we define A{K) to be the best constant for 
which the inequality 

(9) mu^^Qn) < A{R)E{cf>y/'E{i;y/' 

holds for all spacetime cubes Qn of side-length R, red waves ^ of frequency 1, and 
blue waves tp of frequency 2*^ such that one has the margin requirement 

(10) margin(^), margin(V') > 1/100 - (2'^/i?)^/-^. 

The waves ^, ip may take values in arbitrary finite-dimensional Hilbert spaces. 
Note that it suffices to verify (9) for those ^, tp which obey the normalization 

(11) E{cl>) = E{i,) = 1. 

Prom (7) it is clear that A{R) is finite for each R. The margin requirements on 
A{R) are technicalities which are needed for the induction on R to work properly, as 
many of our decompositions will decrease the margin of </> and ip slightly. However, 
we may remove the margin requirements by a finite partition of space and frequency, 
and some mild Lorentz transforms to obtain the crude estimate 

(12) \mQ,<A{R')E{cf>y/'E{i;y/' 

for any cube Qr of side-length R, any R' R, and any red and blue waves ^, ip of 
frequency 1, 2*^ respectively. In particular to prove (3) it suffices to show that 

(13) A{R) < '2CC^2^i'2^^\-\) 
uniformly for all R> Co 2'°. 
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Because of the increasingly strict margin requirements as i? ^ oo we see that 
A{R) is not necessarily increasing in R. We therefore define the auxiliary quantity 
A{R) for all R > Co2^ by 

'A{R) := sup A{r). 

Co2''<r<R 

We now begin the proof of (13). In Section 7 we shall prove (13) when R is close 
to 2^=: 

Proposition 3.2. For any R > Co2^, we have the bound 

A{R) < 2'="='°{R/2''f2^'^p-^\ 

This proposition is needed to begin the inductive argument. The exact power of 
{R/2^) is unimportant as we shall soon improve this bound substantially. 

In Section 8, we adapt the localization ideas from [25] to prove the following 
recursive inequality on A{R). 

Proposition 3.3. Let R > 2Cq2^ and < c < 2"*^", and let (j), tp be re and blue 
waves of frequency 1 and 2*^ respectively which obey the relaxed margin requirement 

(14) margin{(i)), margin{tp) > 1/100 - 2{2^ /Rfl^. 
Then for any cube Qr of side-length R we have 

(15) WmL^iQu) < [(1 + Cc)A{R/2) + c-^2'=(F-i)]E(<^)i/2ij(.^)i/2. 
In particular, we have 

A{R) < (1 + CNc)A{R/2) + c"^2*=<p"5). 

The slight relaxation of margin requirements in Proposition 3.3 (as compared to 
(10)) is important for technical reasons, but should be ignored for a first reading. 

The loss of CNc in the main term comes about because we need to divide an R- 
cube into sub-cubes at various scales, and then shrink those cubes further by about 
c in order to create some separation between those sub-cubes (this is necessary, 
otherwise the error terms can blow xvp). In practice we can always optimize c so 
that its impact on the argument is negligible. 

By setting c = (2'=/i?) CN in Proposition 3.3, we obtain 

A{R) < (1 + CN{2^/R)rTN)A(Rl2) + C{2^ / Rf'^ 2^^^'^^ 
for all R > 2Co2''. Iterating this and using Proposition 3.2 when R ~ 2*^, we obtain 

(16) A{R) < 2^^°{R/2''y/^2'''^^-i^ 

for any R > Cq2}^ . This can then bo used to obtain Theorem 1.1 for p > Pq (cf. 
[25], [23], [2]), but we shall not do so here. 

Prom the preceding discussion we observe that (13) is already proven for R < 
2Cofe Ti^^ns -y^rc jnay assume that R > 2*^"'^. R is now very large, dominating most 
quantities which depend only on k, Co, N, and e. 

In the large R case we need to introduce the notion of energy concentration. 
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Definition 3.4. For any r > 0, spacetime cube Q, red wave </>, and blue wave ip, 
we define the energy concentration E^^q ((/>, V') to be the quantity 

Er,Q{<j>,^j:) ■.= v^B^{\E{<j>)y^E{'il;Y'\suML^D)\mL^D)) 

^ D 

where D ranges over all disks of radius r whose time co-ordinate tj? is contained in 
the lifespan of Q. 

Clearly we have Er^Q((j),%l)) - £;(?/;) 1/2 for any r, with equality or near- 

equality only occurring when 0, 1}) simultaneously concentrate in a disk of radius 
r. However, the choice of whether to use Er^Q{4>,i}) instead oi E{(t>Y''^ E{i}Y/'^ will 
be crucial to make a certain induction work. 

Roughly speaking, the strategy for the R > 2'-^"'^ case is to show that Proposition 
3.3 can be improved slightly unless there is substantial energy concentration. This 
slight improvement will be enough to close the induction, however, one must still 
deal with the concentrated case. In this case we use Hiiygens principle to restrict 
ourselves to a small region of a double cone CJP'^'^p^'' , in which case one can obtain 
an improvement of Proposition 3.3 by other means. 

To make this strategy more precise we shall need the following technical variant 
of A{R) which is weighted slightly to exploit the gain in the non-concentrated case. 

Definition 3.5. For any R > 2'-^"'^ and any r',r > 0, we define A{R,r,r') to be 
the best constant for which the inequality 

(17) 

ll</'V'IU.(QHnc.-.<e(,„,to;r')) < A{R,r,r'){E{<l>y/^E{ij)'/y/PEr 

holds for all spacetime cubes Qr of side-length R, all {xo,to) G R""*"^, red waves 
(j) of frequency 1, and blue waves ip of frequency 2* such that one has the margin 
requirement (10) and the energy bound (11). 

As before, it suffices to verify (17) assuming the energy normalization (11). It 
is important that the right-hand side if (17) is exactly as stated, and not (for 
instance) the comparable quantity A{R,r,r')E{(f)Y/'^ Ei^ipY^"^ . In practice r and r' 
shall usually be comparable in size. 

The preceding heuristics regarding concentration can be formalized in the fol- 
lowing Proposition, which we prove in Section 11. This Proposition is basically an 
application of Huygens' principle, combined with some more sophisticated argu- 
ments to deal with the highly concentrated case r < 

Proposition 3.6. For any R > 2^°*^, we have 

A{R) < (1 - C^^) sup ^(i?,r,Co(l -hr)) -h2'^<^°2^*=2'=(i-i\ 

The requirement < j. jg somewhat difficult to obtain, but it is necessary 

to do so because the tools we shall develop to control A{R,r,r') have a spatial 
uncertainty of about \fR and so one cannot effectively exploit any concentration 
effects near or below this scale. This uncertainty of \/~R, is responsible for all the 
powers of r^/^ and R^^^ which appear in the arguments; these powers should be 
ignored for a first reading. The key point to observe in Proposition 3.6 is that we 
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have somehow wrested a small gain (1 - Cq-'^) from the main term on the right- 
hand side, thanks to the beneficial effects of non-concentration in (17). This gain 
will allow us to absorb all error terms and (1 + Cc) factors in the other Propositions 
in this section, thus closing the induction. 

To use this inequality inductively we need to bound A{R, r, r') in terms of A{R). 
From (12) it is easy to show that A{R, r, r') < CA{R), but this is too crude to close 
the induction, and one must take some more care with the constant in the leading 
term. 

The bounding of A{R, r, r') by A{R) can be split into two stages. First, we shall 
use finite speed of propagation in Section 12 to observe that one can obtain the 
desired bound in the non-concentrated case r > Cq R: 

Proposition 3.7. For any R > 2'^°'', r > C^R, r' > 0, and < c < 2'^°, we 
have 

A{R, r, r') <{1+ Cc)A(Rj + c-^2'=(^-i\ 

For the concentrated case we iterate the following Proposition, which is proven 
in 9. 

Proposition 3.8. For any R > 2'^°'' and C^R >r> i?V2+3/JV^ ^^^g 

A{R, r, r') < (1 + CNc)A{R/Co, r(l - Cr-^/^^), r') + c-^{l + -^)-^'''2''^^-^^ 

for any < c < 2''^° . 

The decay of (1 + 2^)^^^^ in the error term is crucial for this cndpoint result 
as it allows us to avoid losing a logarithmic factor log(i?/r) in the iteration, which 
would otherwise be fatal to the proof of the endpoint. This decay ultimately arises 
from the improved energy estimates on cones as encoded in Lemma 13.1. However, 
the presence of the 2'^ means that we still lose a factor of k when summing over 
scales R, and this is the main source of the 2^*^ loss in (3). 

These two Propositions combine to give 

Corollary 3.9. For any R > 2*^°'= and r > i?V2+4/JV^ ^g ^^^g 

A{R, r, Co(l + r)) <{1 + CNc)A(Rj + c-^2^'=2'=(^-5) 
for any < c < 2~<^o. 

Proof We may assume that r < CqR since the claim follows from Proposition 3.7 

otherwise. Let J be the first integer such that r > 2~^Cq R; from the hypotheses 
we have J < log(r). Define r =: ro > ri > . . . > rj inductively by r^+i := 

rj(l — Crj~^^^^). One can verify inductively that rj = r + 0{jr~^/'^^) for all j, 
and in particular that rj ~ r. 
From Proposition 3.8 we have 



A{R/2^, r,-, Co(l + r)) < (1 + CNc,)A(R/V+\rj+^,Co{l + r)) 
for any < c^- < 2-<^°. 



+ c7^(l + ^)-i/iV2fe(|-i) 
J ^ 2i2^r' 



10 



TERENCE TAG 



Observe that 

R 

If we thus set c, := k~^c2~'^'^~^~''^+^^'^^^^ for a suitable constant C, we thus obtain 



l + _^>C-C2(^-i-fe)+. 



A{R/2\rj,Co{l+r)) <(1 + C7Vfc-ic2-(-^-^-'=)+/(2C^^))A(i?/2^+i, r^+i, Co(l + r)) 

+ fcC^-C2-(J-j-fe)+/2Ar2fe(^-5). 

Iterating this, one obtains 

A{R, r, Co(l + r)) < (1 + CNc)A{R/2'^, rj, Co{l + r)) + c-^2''^^-^\ 
The claim then follows from Proposition 3.7. ■ 
Combining this with (3.6) and setting c := 2~^° , we thus obtain 



A{R) < (1 - C^^)A{K) + 2CC<'2^'=2'=(5-5 



-'0 

for all R > 2'-^ffe_ Combining this with (16) we thus see that this inequality thus 
holds for all R > Co2'^. Taking suprema and using the monotonicity of A{R) we 
thus obtain 

'A(R)< (l-Co-'=')A(fi) + 2'^<='«2'^'=2'=(p-5) 
for all R > Co2'^, which implies 

A{Rj <2^'^'>2^''2''^p-i\ 

and (13) follows for all R > Ca2^ as desired. 

It remains to prove Propositions 3.2-3.8. It turns out that most of these propo- 
sitions follow as consequences of a localization property for red and blue waves, 
Proposition 4.1, which wc shall state in the next section, after some notation. In 
Sections 7-12 we show how this Proposition implies Propositions 3.2-3.8. Finally, 
in Sections 13-16 we give a proof of Proposition 4.1. 



4. The main proposition 

In this section we state the main proposition of the argument. Proposition 4.1. 
As we shall see in the next few sections, this Proposition will be the main tool used 
to prove Propositions 3.2, 3.3, and 3.8, and also plays a minor role in the proof of 
Proposition 3.7. 

Proposition 4.1 involves the localization of waves (j), tp on a large cube Q into 
smaller waves localized on sub-cubes of Q. To make this precise we must introduce 
some notation. 

If <5 is a cube of side-length R, and j > is an integer, we may partition Q into 
2in+i)j py^jgg Qf side-length 2-^i?; we use Qj (Q) to denote the collection of these 
cubes. 

If (5 is a cube and j > is an integer, we define a red wave table <p on Q with 
depth j to be any red wave with the vector form 
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where the components i/)^'-' may themselves be vector- vahied. If < j' < j, and 
q' € Qj'{Q), we define ^ to be the red wave table on q' with depth j — j' given 
by 

:= 0(9) for all q € Qj-j'{q'). 

Note that 

E{cl>) = J2 ^(^^"'^^ 

for all < j' < j. 

If (/) is a red wave table on Q of depth j and < j' < j, we define the j' -quilt 
of (j) to be the non-negative function 

Note we have the pointwise estimates 

(18) |0(«)|x,<['/'b<[0],-i-.-<Mo = lto 

for all q G Qj{Q). We define blue wave tables and their quilts analogously. 

The estimates (18) are of course very crude, and wc shall frequently be exploiting 
various improvements to this estimate in the sequel. 

If Q is a cube, /c > is an integer, and < c <C 1, we define the {c,k) -interior 
r-^{Q) of Q by 

(19) P'\Q):= U (l-c)g. 

9eCfc(Q) 

The advantage of working with I'^''^{Q) instead of Q is that the sub-cubes of P'''{Q) 
have some significant separation properties. 

With these notational preliminaries, we are now ready to state Proposition 4.1. 

Proposition 4.1. Let R > CqI^ , < c < 2^*^", and let (j), ip be red and blue waves 
with frequency 1 and 2'^ respectively, which obey the energy normalization (11) and 
the relaxed margin requirement (14). For any cube Q, define the set X{Q) d Q by 

(20) X{Q) := n I^'-"'-'""'={Q). 

j=Co 

Then for any cube Q of side-length CR, we can find a red wave table ^ on Q with 
depth k and frequency 1, and a blue wave table ^ onQ with depth Co and frequency 
2^ , such that we have the margin estimates 

(21) margini^), margin{^) > 1/100 - {2^+^° /Rfl'^ , 
we have the energy estimates 

(22) E{^),E{^)<l + Cc, 
and we have the inequality 

(23) UnLnxm) < ll[*]fc[*]colUnx(g)) + c-^2'=(i-5). 
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For any cone C^'^P''^{xo,to;r) with r > 1, we may improve (23) to 

\\<P'4>\\LP{X{Q)nCP»'-P^^{xo,to;r)) < II [*]fe [*]Co IUp(X(Q)nCP'"-P'«(a:o,to;'-)) 

(24) +c-^2'=(^^)(l + A)-i/-. 

Z'^r 

Furthermore, we have the persistence of non- concentration 

(25) i?.(l-Cor- V4«),2q($, *) < i^r,2Q(<^, ^)+Cc+ c'^ii^-^/^ 

for allr>R}'^+^l''. 

The reason why we use X[Q) instead of Q is that the sub-cubes of X{Q) have 
some non-zero separation between them, which will let us avoid some unpleasant- 
ness in Section 15. To return from X{Q) to Q we shall use an averaging lemma, 
Lemma 6.1; this causes the loss of Cc in many of the Propositions stated previously. 
The estimates (22), (25) are stating that ^ are "smaller than or equal to" 0, in 
energy norm, while (23), (24) state that the quilts [<&]fe7 [^]co ^re good approxima- 
tions for (f), ijj respectively. This replacement of waves by quilts of essentially equal 
or lesser energy allows one to induct efficiently (providing that one has constants 
close to 1 in the main terms). 

Note that the margin requirements on 0, i/; are slightly weaker than those in 
Definition 3.5. The gain of (1 + 2^)"^/^ in (24) over (23) is responsible for the 
corresponding gain in Proposition 3.8 as compared against similar estimates such 
as Proposition 3.3. This gain is essential as it needs to compensate both for the Cc 
loss in (22) and for the log(i?) loss that would otherwise arise from an induction on 
scale. 

Interestingly, the low-frequency wave (j) can be localized much further than the 
high-frequency wave V'; the former can be localized to cubes of 2"*^ the side-length, 
whereas the high-frequency waves can only be localized by about 2~^° before the 
error estimates begin to deteriorate logarithmically. This behaviour is also seen in 
the example given in the introduction demonstrating that the power of k in (3) is 
essentially sharp. In applications we shall only need to localize to 2"*^° , however 
in the proof of Proposition 4.1 we shall need to localize </> all the way down to 
before one can begin to localize 

The proof of Proposition 4.1 is the longest part of the argument, and is basically 
a "pigeonhole- free" version of the arguments in Wolff [25] . Since it is the statement 
of this Proposition, rather than the method of proof, which are required to prove 
the Propositions of the previous section, we shall defer the proof of Proposition 
4.1 to sections 14-16. For now, we devote ourselves to the question of how this 
Proposition, combined with some other more elementary tools, can be used to 
prove Propositions 3.2-3.8. 

5. Energy estimates 

In this section we record some fairly easy energy estimates which will be used 
throughout the paper. 

Let 0, 'ij) be red and blue waves. By integrating (1) along the life-span of a cube 
we see that 

(26) ML^iQ^ < R"^E{<j>f/\ UWl^^q,) < R"^E{i,f'^ 
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for any cube Qr of side-length R. By Holder's inequality we thus have 
(27) \m\LHQn)<RE{<t>f'^E{i,f/\ 

To compensate for the loss of R in (27) we shall often seek variants of (7) in 

which one has a gain gain exactly balances the loss of R when 

interpolated at the exponent p = po- One such example of this gain is Lemma 14.2; 
see also (54), (108). 

To tackle the case fc ^ 1 of widely differing frequencies we shall need the following 
improvement of (27) when (j) is replaced by a quilt. 

Lemma 5.1. Let Qr be a cube of side-length R > 0, j > is an integer, 4> be a 
red wave table on Qr with depth at least j, and tjj be a blue wave. Then 

IIMill2<2-^/^i?V2i;(0)V2, 

and 

ll['/'L-^lli<2-^/'i?s(0)V2^(V,)V2. 

Proof Let q e Qj{QR). Prom (26) for ^(s) have 

ll<^^^^IU=(,)<2-^-/2^1/2^(^(,))l/2_ 

Square summing this in q we obtain the first estimate, and the second estimate 
follows from (26) for ■0 and Holder's inequality. ■ 

This improvement over (27) is one demonstration of the gain involved when pass- 
ing from a wave to a quilt. The other major advantage of quilts is that they allow 
one to localize estimates on large cubes to estimates on small cubes so efiiciently 
that one can make induction on scale arguments work. 

6. An averaging lemma 

The following averaging lemma will be needed in the proofs of Proposition 3.2, 
3.3, 3.8. It allows one to average out a bounds on X{Q) to obtain a bound on Q. 

Lemma 6.1. Let R > 0, < c < 2~'-^°, and F be an arbitrary smooth function. 
Let Qr be a cube of side-length R. Then there exists a cube Q of side-length CR 
contained in C^Qr such that 

||i^lU.(Q«)<(l + C7Vc)||F|U.(x(Q))), 
where X{Q) is the set in (20). 

Proof By the pigeonhole principle it suffices to show that 

ll^lli''(QH) - |Q^_^^(l + <^^)^ll^llinQ«n(X(Q(:r„,io;C7JV.)))) dxodto- 
From the symmetry and translation covariance of X{Q) we have the identity 

/ \\nUQ^nixiQM;CRm dxodto = [ \F{x,t)f\X{Q{x,t;CR))nQR\dxdt. 
JQr JQr 

On the other hand, from (19) we have 

|Q(xo, io; CR)\r^''{Q{xo, to; CR))\ < c\Q{xo,to; CR)\ 
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SO from (20) we have 

\Q{x, t; CR)\X{Q{x, t; CR))\ < Nc\Q{x, t; CR)\ 

and thus that 

\Qr\ < (1 + CNc)\X{Q{x, t; CR)) n Qr\. 
The claim then follows. ■ 

7. Proof of Proposition 3.2 

We now have enough machinery to prove Propositions 3.2, 3.3, and 3.8. We 
begin with Proposition 3.2. 

Fix R > Co2'^, and let 0, tp be red and blue waves of frequency 1 and 2*^ 
respectively, obeying the margin requirement (10) and the energy normalization 
(11). Let Qn be a cube of side- length R. To prove Proposition 3.2 it suffices to 
show that 

ll'/'V^llL.(Q«)<2^^"(i?/2'=)^2'=(^-i). 

Set c := 2"^", so that 1 + CNc ~ 1. From Lemma 6.1 with F := (pip, we may 
find a cube Q of side-length CR such that 

Uvhp(QH) ^ II^V'IIlp(x(q))- 

Let ^ be as in Proposition 4.1. By (23) and (18), we thus have 
By (18), (7) and (22) wc have 

wmk^h < ii$*ii2 < E{^y/^E{^y/^ < i, 

while from Lemma 5.1 and (22) we have 

||[*]fe*||i < 2-'=/2i?£;($)i/2^(^)i/2 < 2-fc/2iJ = (i?/2'=)2'=/2. 
Interpolating between the two, one obtains 

and the claim follows. ■ 

8. Proof of Proposition 3.3 

Let 0, i/i, Qij, c be as in Proposition 3.3. We may assume the normalization (10). 
We may assume that R > 2'~^^'>2'' for some large C, since the claim (15) follows 
from Proposition 3.2 otherwise. 

From Lemma 6.1 with F := <pip, we may find a cube Q of side-length CR such 
that 

\\H^\\LP{Qn) < {^ + CNc)\\cl>^\\LP^xiQ))- 

Let ^ be as in Proposition 4.1. By (23), we thus have 

WHh.iQ^) < (l + C7Vc)||[<i>]fc[vI/]colU.(x(Q))+c-^2'=(^-^). 
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By (18) and the hypothesis k ^ Co, we thus have 

(28) ||<^VI|l.(q«) < (l + C7Vc)||[$]co[*]collL.(Q)+c-^2'=(i-^). 

We expand 

qeQc„(Q) 

By (9) and (21), then the inclusion P C l^, followed by Cauchy-Schwarz and then 
(22) we thus have 

\mCo{^]Co\\L^iQ)<A{2-^oJi)^ i5($(9))f/2£;(*(«))P/2)l/P 

qeQcoiQ) 

< A{R/2) £;($W)V2^(^(9))i/2 

geQcoiQ) 

<A{R/2){ i;($(«)))i/2( ^^^{g)^y/2 

qeQco(Q) qeQcoiQ) 
= A{R/2)E{<py^^E{^y/^ 

< (1 + Cc)A{R/2). 

Inserting this back into (28) we obtain the desired estimate (15). ■ 

9. Proof of Proposition 3.8 

We now prove Proposition 3.8. This will be a reprise of the argument used to 
prove Proposition 3.3, but will use non-concentration to improve upon the inclusion 
F C used previously. 

Fix R > 2'^'''=, < c < 2-'^o, r' > 0, and C^R >r> R^/'^+^'^ . Let (j), V be red 
and bhic waves of frequency 1, 2^ respectively obeying (14). We may assume the 
normalization (11). Let Qr be a spacetime cube of side-length R, and let (a;o,io) 
be a point in spacetime. To prove Proposition 3.8 it suffices to show that 

ll</'V'l|L.(Q„nCP-.'=(.o,fo;r')) <(1 + CNc)A{R/Co,fy)Er,CoQn{^,i^Y''' 

+ c--(l + ^)-V-2'=(i-i) 

where f := r(l — Cor~^^^^). Here we have used the fact that E^^CoQa (</>; V')^^^ ~ 1- 
Applying Lemma 6.1 with 

we see that there exists a cube Q of side-length CR contained in C^Qr such that 

ll'/'V'llLP(QHnCP"'-P''=(rrQ,to;r')) (1 + c) || f/)?/;!! LP(X(Q)nCP"'-P''=(a;o,to;r')) ' 

Let be as in Proposition 4.1. By (24) and the inclusion 2Q C CoQr, it thus 

suffices to show that 

il mk^Co ilLP(X(Q)nCP"-P-(-o,to;r')) <(1 + Cc)AiR/Co, f, r')Er,2Q{<P, ^Y'"' 
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since the latter factor can easily be absorbed into c '^(1 + ^/^2*'^p since 

R > 2^0''. 

By (18) we can estimate by ['I'lc'o- By (25) it thus suffices to show that 

!l[*]c„[4']c„||L.(Qnc."'-.-(.„,t„;,.')) < {l + Cc}A{R/Co,f,r')E,^2Q{<^,^Y/p'. 

Here we have used some trivial bound in A{R/CQ,f,r') which is polynomial in R, 
e.g. by (7) and Holder. 

Raising both sides to the p*'' power and expanding, we reduce to showing that 

(29) Yl ll*^''^*^'^lli.(,)<(l + <:?c)A(i?/C'o,r,r')^£;?,2Q($,*)^/^'. 

qeQc„{Q) 

The cube q has side-length 2-'^°CR < R/Cq, so in particular CqQ C 2Q. By (17) 
and (21) we thus have 

||*'''^**'''||Lp(9)^(^/Co,r,r')(i^(4'<'')^/^i^(4'^''^)^/^)^''Pi^f,2g($^''\**'')^/^'- 
Also, by (22) we have 

Ef,2Q{^'^''\'f^^'>) < (1+Cc)S,,2q($,*). 
Comparing these estimates to (29), it suffices to show that 

Y £;($(«))i/2^(*(«))^/2 < 1 + c'c. 

geQcoiQ) 

But this follows from Cauchy-Schwarz and (22) (cf. the proof of Proposition 3.3). 



10. Spatial localization, and Huygens' principle 

We shall shortly begin the proof of Propositions 3.6 and 3.7, but we must first 
develop some machinery concerning the localization of a wave to a disk, and the 
resulting estimates arising from Huygens' principle. This machinery will also be 
useful in proving the persistence of non-concentration estimate (56) in Section 15. 

To localize a wave to a disk we shall need a bump function r]o and an evolution 
operator U{t), which we now construct. 

Let r]o to denote a fixed non-negative Schwarz function on R" with total mass 
1 and whose Fourier transform is supported on the unit disk; such a function can 
be constructed for instance by r^o = * '^j where (p is a. real even bump function 
supported near the origin such that (p{0) = 1. For any r > we define ijr by 

(30) Vrix):=r-"voi-)- 

r 

The function rjr is thus an L^-normalized Schwarz function concentrating on an r- 
disk with good frequency localization properties. We shall use this cutoff function 
both in this section and in the construction of the wave packet decomposition in 
Appendix I. 

Fix a(^) to be a bump function supported on E which equals 1 on the spatial 
projection of S'''^'^ and S'''"''. We define the evolution operator U{t) = U'^^'^{t) by 



(31) 



mf(.0 :=«(Oe'"'*'«'/(C) 
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and the kernel Kt by 

Kt{x) j a(C)e2'^'(^-€+*l€l) d^. 
We have the propagation law 

(32) 4>{t) = J7(t)0(O) = </.(0) * Kt 

for all red waves <^ with frequency 1, and all times ti,t2- A stationary phase 
computation gives the pointwise estimate 

(33) \Kt{x)\ < (l + dist((,x,i),C"-^'^(0,0))-^". 

We shall also use the operator U {t) in the proof of an energy estimate on cones 
(Lemma 13.1), and in the proof of wave packet decomposition (Lemma 15.2) in 
Appendix L 

Definition 10.1. If D = D{xD,tD',r) is a disk, and is a red wave of frequency 
1, we define Pd4> at time to W 

PD<P{tD) ■■= {XD * ??ri-i/iv)0(iD) 

and at other times t by 

PD<l){t) ■.^U{t-tD)PD<PitD)- 

Here rjr. For red waves of frequency 2^ we define Pd by the formula 

Po'Djcj, BjPd^, 
and for blue waves we define Pd by the formula 

PoTcf) := TPD(t>. 

The operator Pd localizes a wave to the disk D at time to, while 1 — Pd similarly 

localizes to D'^^*. More precisely: 

Lemma 10.2. Let j be an integer, r > Cq2~^ . Let D = D{xD,tD',r) be a disk 
with radius r, and let (j) be a red wave with frequency 2^ and margin m,argin{(f)) > 
Co(2-'V)~^+^/-'^. Then PD(f> is a red wave of frequency 2^ which satisfies margin 
estimate 

margin{PD<j>) > margin{(j)) — Co{2^r)~^~^^^^ 
and the energy estimates 

(34) UD^'Ponmo^^^^) < (2^r)-^£;(.^)V2 

(35) 11(1 - PD)cP\\mD.) < (2^r)-^£;(<A)V2 

(36) E{Pd<I>) < Mh^n^^ + Ci2^r)-''E{<I>) 

(37) E{{1 - Pn)^) < \ml.^nt^^^ + C{Vr)-^ E{<l>) 

(38) E{Pd(I>), E{{1 - Pn)cp) < E{cP) 
where are the disks 

D± :=£)(a;B,iD;r(l±(2^r)-V2JV)). 
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Note in particular that 

-D c D_ c D c D+ c 2D. 

2 + 

Proof By scaling it suffices to verify this when j = 0. The claims follow directly 
from the easily verified estimates 

< xn *r/^i-i/jv(a;) < 1, 

Xd * r]ri-i/N{x) < for x G 
1 - Cr"^ <Xd* Vr^-UN{x) for x e D"^*. 

■ 

Applying T wc sec that similar statements hold for blue waves. 

We now investigate the localization properties of Pd for times other than to- 
Heuristically speaking, Pncf) is supported on the cone C^'^'^{xD-,tD,Cr), while (1 — 
Pd)<I> vanishes on the cube Q{xD,tD', C~^r). More precisely, we have 

Lemma 10.3. Let D be a disk of radius r > Cq, and let (p he a red wave of frequency 

1 and m,argin margin{cj)) > C^r^^^^/^ . Let 1 < q < 2, R> r. 

• If ip is a blue wave of arbitrary frequency, we have the finite speed of prop- 
agation law 

(39) ||((1 - PD)m\L.iQi.o,to,C-^r)) < T^-"" E{<I>Y'^ E{^f'^ 

and the Huygens ' principle 
(40) 

ll(J'D'/')^llL.(Q(x,„t.;K)\C'-(x.,t.;C.+KV-)) <i?^-^i^(<A)'/'i^(^)'/'. 

• Iftpisa blue wave of frequency 2^ for some k>0 such that margin{tl)) 
(2'=r)-i+i/^, then we have 

(41) 

We will also need to use (39), (40), (41) with the roles of (j), tp and red and blue 
reversed. If one repeats the arguments below, one finds that one loses a factor of 
2*^^^ on the right-hand sides; however, if r > 2'^ one can absorb these losses into the 
factors of r*^"^ or R^-^. Thus we may freely interchange (j) and ip and red and 
blue in the r > 2*^ case. 

Proof By Holder it suffices to verify the claims when q = 2. 
To prove (39) wc first use (32), (33), and (35) to obtain 

11(1 - PDmL-iQi.n,tn-.C-^r)) < T^-"" E{ct>f'\ 

and (39) follows from (26). A similar argument gives 

(42) \\PDHL--(Q(xo,tD\R)\C^''^{xD,tD\Cr+m/«)) ^ R^~^ ^((pY^'^ , 

and (40) follows from (26). 

We now prove (41). From (40) with ip replaced by Poi^, we have 

||(Pz50)(/^Z5^)|L.(Q(.,,t,;K)\C^e.(.,,t,;C. + iil/«)) <^''-^i^(<^)'/'i^W'/'- 
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By applying (42) with R, r replaced by 2''R, and then applying TD^, we 
obtain 

\\PD4'\\L'^(Q(xD,tD;R)\C>''"'{xD,tD;Cr+m/")) ^ 
SO by (26) we have 

||(P,,0)(Pz,V)|L^(Q(.,,t,;K)\C-.(..,t.;C.+HV«)) <i?^-^£;(0)^/^i?(^)^/2. 

Combining this with the previous estimate we obtain (41). ■ 

The estimate (41) is one way of exploiting the transversality of red and blue null 
directions. We shall also rely on transversality in other ways, notably in Lemma 
13.1 and in Lemma 14.3. 

11. Proof of Proposition 3.6 

We now have enough machinery to prove Propositions 3.6 and 3.7, except for a 
somewhat difficult estimate. Lemma 11.1, which we defer to Appendix II. 

Fix R > 2'-'°'^, and let </>, tp be red and blue waves of frequency 1 and 2^ respec- 
tively, obeying the margin requirement (10) and the energy normalization (11). Let 
Qr be a cube of side-length R. To prove Proposition 3.6 we have to show that 

(43) 

||</>V'I|l.(q«)<(1-Co-^) sup A{R,r,Co{l + r))+2^'^o2ek2Hi-i)_ 

We may of course assume that (f), tp are nearly extremal in the sense that 

(44) mhHQ^) ~ A{R). 
We may also assume that 

(45) A{R) > 2CC'o2'=(p-5) 

since (43) is trivial otherwise. 

Let 0<(5<l/2bea small number to be chosen later. Let r be the supremum 
of all radii r > Co2'' such that 

(46) Er,CoQn{^,i^)<'^-S, 

or r = Ca2^ if no such radius exists. 

Since 0, ip are smooth and have finite energy, it is easy to see that r is well 
defined, and that one can find a disk D = D{xo,to;r) of radius r with to in the 
lifespan of CqQr for which 

(47) minmhiD),MhiD))>'^-25. 
Fix the disk D{xo,tQ;r), and define the disk D' by 

D' ■.= Cy^D = D{xo,to;Cl'''r) 
and define the double conic region O by 

f2 := Qij n C^^^^^^{xoM; Co(l + r)). 
We now show that in Qr, the expression (jyip is mostly concentrated in Q.. 
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We split (j) and ip into red and blue waves respectively as 

(P={1-Pd')<P + Pd'<P, ^ = (1-Pd')V' + -Pd'V' 

where the projection operator Pn' is as in the previous section. 
From (47), (11), and (37) we have 

E{il - Pd>)4>), E{{1 - Pd')^) < 5 + Co"^. 

Prom (12) we thus see that 

(48) ||((1 - Pwm^ - PD'ML.iQn) <{S + Co'')A{R). 

On the other hand, from (40) and its analogue with the roles of red and blue 
reversed (using the assumption r > 2'^; see the remarks following Lemma 10.3) we 
have 

\\{Pn'm\L.iQn\n), ll((l - PD')cl>)PD'nL.iQn\n),<Co''. 

Combining these estimates using the triangle inequality we obtain 

\mL^{Qn\n)<{S + Co'')A{R). 
We thus see from (44) that 

\\mLnQn\n) < {s + Co'')\\mLnQn)- 

Raising this to the p*'' power and re-arranging, we see that 

(49) mh^in) > (1 - C{6 + Co-^)^)^/^||.^^|Up(q«). 

We now divide into two cases. First suppose that we are in the medium or low 

concentration case r > In particular we have r > Co2'^, so from the 

definition of r we see that (46) holds. From the definition of A{R, r) (and the 
assumption (5 <C 1/2) we thus have 

Combining these estimates and setting 6 to some small negative power of Co we 
obtain the desired estimate (43) as desired (provided that Cq is large enough; (5) 
suffices). 

Now suppose instead that we are in the highly concentrated case r 
Define R by the formula 

R := max(2'^°'=, (r) Wiyiv). 

thus 2'-^°*^ < R< R. In analogy to the previous case, we observe that 

m\L.iQM-m<{^-Sy^'''A{R,r) 

when R > 2'-^°*^. When R = 2^"'^ one can use (13) (which has already been proven 
for this value of R) and (12) to obtain 

ll'^^llL.(Q(.o,*o;fl)nO) < 2^^°2^'=2^(i-i). 
In either case, we see that if we could show that 

(50) m\LnQ.\Qi.o,to;R)) ^ {S + Co^)AiR) 
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then we could repeat the argument in the first case and obtain (43) as desired. 

It remains to prove (50). By (48) and the triangle inequality it suffices to show 
that 

\\{PD'm\LnQn\Qi^o,wM ^ {^ + C^'')A{R) 

and 

\m-PD')4>)PD-n^.(Q.\cii.„Mm ^ {S + C^''')A{R). 
By a dyadic decomposition and (45) this will follow from 

Lemma 11.1. Let R > , andletCol^ <r< R^'^+'^l^ . LetD = D{xo,tQ,Cl'^r) 
he a disk. Then for any red wave ^ and blue wave of frequency 1, 2^ respectively 
with 

'margin{(j)) , 'margin{ip) > 1/200 

we have 

The proof of this lemma shall be postponed to Appendix II, as it requires some 
machinery which is developed later in this paper, and the arguments used to prove 
this lemma arc not used elsewhere. The estimate in Lemma 11.1 is not of an 
endpoint type, and could probably be proven by more elementary means than the 
methods used in the Appendix. 

Assuming Lemma 11.1, the proof of Proposition 3.6 is thus complete. ■ 

12. Proof of Proposition 3.7 

Let R > 2<^ofc^ < c < 2~<^o, and r > CqR. Let 4>, ip be red and blue waves 
of frequency 1 and 2*^ respectively obeying (11) and (10). Let Qji be a cube of 
side-length R. To prove Proposition 3.7, it suffices to show that 

WmL^iQn) < Er,CoQu{<P,i')'^'{^ + Cc)A(R)+C-''2''^^^-i\ 

Let {xQn,tQj^) be the center of Qr, and let D denote the disk 

D:=D{xQ^,tQ^;r/2). 

We then divide 

= (1 - Pd)^ + PD(t>, V' = (1 - Pd)^) + Poi). 

From Lemma 10.2 (especially (36)), Definition 3.4 and the hypothesis R > 2'-^°^ 
we see that Pd(I>i Pd^P obey the relaxed margin requirements (14) and the energy 
estimates 

From Proposition 3.3 we thus have 

II {Pn<i>) (PZ5 V) h^Qn) <i^ + Cc) iEr,CoQn i^l^, ^) + Ci?^"^) A(l) + C-''C^2''^ 5 " . 

The R^^'^ term can easily be absorbed into the c~'-^Cf 2'^'-p ~ 2) term by some 
trivial bound on A{R) (Proposition 3.2 will do). By the triangle inequality, we will 
thus be done if we can show 

||((1 - PD)m\LHQ.), WiPom - PDmiLHQ.) < c-^Co^2'=(f-|). 
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But these estimates follow easily from (39) and the counterpart with red and blue 
reversed (this is legitimate by the hypothesis R > 2'-''°'' and the remarks following 
Lemma 10.3). 



13. Energy estimates on light cones of opposite color 

To prove Theorem 1.1 it remains only to prove Proposition 4.1. Of course, we 
shall not use Propositions 3.2-3.8 in the proof of this Proposition. 

In this section and the next, we derive two basic estimates needed to prove 
Proposition 4.1; these are "pigeonhole-free" versions of similar estimates in [25]. 
The first estimate is an extension of (26) from a cube to a conic neighbourhood 
of opposite color. The second is a strengthening of (7) when at least one of the 
waves (j), tp has low frequency dispersion. Then, in Section 15, we combine these two 
estimates with a wave packet decomposition to derive a key ingredient (Proposition 
15.1) in the proof of Proposition 4.1. 

We begin with the energy estimate on cones: 

Lemma 13.1. Let 4> he a red wave of frequency Then for any {xo,to) e ii"^^ 
and R > we have 

Proof By translation invariance we may take {xo,to) = 0, and by scaling we may 
take j = 0. 

The blue cone is a null surface, and so standard energy estimates will not prove 
this estimate. However this can be salvaged since is red, so that the characteristics 
of (j) will be transverse to the blue cone. 

We turn to the details. Let U{t) be the wave evolution operator defined in (31). 
By (32), it suffices to show that 

( / \\XsMt)f\\l^nn^ dtf'^ < i?^/^||/|L.(Rn) 

for all / e L2(r"), where St is the slice St := {x : {x, t) e C'''"«(0, 0; R)}. 

We shall apply the TT* method. By duality the above estimate is equivalent to 

II j t/(0*(xs,F(i))rftlL^(R")<^'/'II^^IL.(R'-^)- 

We square this as 

/ / {xs.U{s)U{tnxsMt)),ns)) dtds < R\\F\\l 

where (/, g) = Jri f{x)g{x) dx is the usual complex inner product. By Cauchy- 
Schwarz and Young's inequality it suffices to show that 

\{Xs.U{s)U{tnxs.F{t)l F{s))\ <{l + \s- t|/ii)-^||i^(i)||2||F(s)||2 

for all s, t. 

When \s — t\< R this follows from Cauchy-Schwarz and the boundedness of 
the operator xs, U {s)U{t)*xSt > so we may assume that \s — t\ ^ R. The convolution 
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operator U{s)U{t)* behaves essentially like U{s — t), and its kernel Ks^t{x) satisfies 
a similar kernel estimate to (33), namely 

\K,,t{x)\ < (1 + dist((a;, s - t), C-'^(0, 0))"^". 

The claim then follows from the transversality of C"'^'*(0, 0) and C'''''^^{xQ,to) and 
crude estimates. ■ 

Applying T, we see that a similar estimate holds with the roles of red and blue 
reversed. As a corollary of both Lemma 13.1 and its time-reversed counterpart, we 
obtain the following improvement to (27) when one restricts to a double cone. 

Corollary 13.2. Let (t>, ij: be red and blue waves of frequency 1, 2*^ respectively. 
Let R > r ^ I, {xo,to) G i?""*"^, and Qr be a cube of side-length R. Then 

||<^VIU=^(C."^.-(.o.to;.)nQ«) < ri/2i?i/2i^(0)i/2ij(^)i/2. 

Proof The contribution of C'''"^(a;o, to; r)r]Qii is acceptable by using Lemma 13.1 
to control (p and (26) to control ^. Similarly for the contribution of C^'^'^{xo, to]r)ri 
Qr- ■ 



14. Bilinear L'^ estimates in the low dispersion case. 

To prove Proposition 4.1 we shall need to study waves with rather low frequency 
dispersion. To make this concept precise, we introduce the following notation, 
which we shall re-use in later sections. 

Definition 14.1. If (/) is a red or blue wave (of any frequency), we define the 
angular dispersion of (j) to be the quantity 

diam{||| : (^,r) e supp(^)}. 

A key observation (implicit in [16], and exploited explicitly in [25]) is that there 
is an improvement to (7) when either ^ ot ip has low dispersion. 

Lemma 14.2. [16, 25] Let (j) be a red wave of frequency 1, and let ijj be a blue wave. 
If 4> has angular dispersion 0(l/r) for some r^l, then 

(51) ll<^V'IL.(i2»+i) < r-("-i)/2E(<^)V2i;(v,)V2. 

Proof Let -0 have frequency 2^ . From hypothesis, the frequency support of (j) is 
contained in a a sector T of width 0(l/r). By Plancherel's theorem it thus suffices 
to show that 

\\fda^*gda2h<r-^''-'^'^\\f\\mT)\\9\\m2^,:u...) 

for all / and g, where da\ and da^ denote surface measure on T and 2*^1]'''"® 
respectively. By Young's inequality we have 

||/rfai*5rf^2||i<||/||iN|i 
so it suffices by interpolation to show that 

||/rfai*5rf^2||oo<r-("-i)||/||oo||5l|oo, 
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which by positivity reduces to showing that 

(52) ||dai*da2||oo<r-("-i). 

But this follows from the observation that the intersection of T and x — is 
always transverse and has n — 1-dimensional measure at most 0(r~^"~^^) for any 
X e R"+^ ■ 

This estimate is global in spacetime, however for our purposes it will be conve- 
nient to work with a localized form of this estimate. 

Lemma 14.3. Let j be an integer and r » 1,2~^ . Let (j) be a red wave with 
frequency 1 and angular dispersion 0{l/r), and let tp be a blue wave with frequency 
2^ such that 

margin{(l)) , margin{tp) > 1/200. 

Then we have 

(53) WHWlhq) < r-("+^)/^||0|k.(CQ)IIV'lk=(CQ) +r^-^i^(0)'/'i?(V')'/' 
for all cubes Q of side-length r. 

Proof The idea is to combine Lemma 13.1 with the localization machinery devel- 
oped in Section 10. 

Let D be the disk D := D{xQ,rQ;C^^'^r). If C is sufficiently large, then from 
(39) and (38) we have 

II ((1 - Pn)m\LHQ) < r°-''E{<l>f/^E{'^f'^ 

and 

\\Pd^{1 - PommQ) < r^-''E{<t^f'^E{^l^f'\ 
From the triangle inequality we thus have 

\m\L^iQ) < ||(Pi.</')(PBV)IL.(Rn+i)+Cr^-^£(</.)V2^(V,)V2. 

From Lemma 10.2 we see that Pd</) has dispersion 0(l/r). By Lemma 14.2 and 
(38) we thus have 

WHWlhq) < r-^'^+'^/^nmcDMlmcD) + r'^-''E{ct>)^/'E{i;y/^ 

The claim then follows by letting to range over the lifespan of Q, averaging, and 
applying Cauchy-Schwarz. ■ 

An analogue of this lemma exists with the roles of red and blue reversed, but we 
shall not invoke this analogue explicitly. 
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15. Wave packet decomposition 

Using the resuhs of the previous section, we can now prove the main tool used 
to derive Proposition 4.1. 

Proposition 15.1. Let j,j' be integers, and let R > 2^^,2^^'^^' , < c < 2''^". 
Let Q be a spacetime cube of side-length R, (p be a red wave of frequency 2^ with 
margin{4>) > (2^ J?)~^/^, and tp be a blue wave of frequency 2^ . 

Then there exists a red wave table $ = $c V'i Q) on Q with depth Cq, frequency 
2^ with margin 

margin{^) > margin{(j)) - C(2-'i?)~^/^ 
such that the following properties hold. 

• (l^]co approximates (j)) We have 

(54) \m - [a>]co)^llL^(/=.co(Q)) < c-^(|)("-i)/4£;(<^)i/2£;(v,)V2. 

• (Bessel inequality) We have 

(55) £($) < (1 + Cc)E{(j)) 

• (Persistence of non- concentration) For any r > R^/'^+'^/^2~^^^/'^~^/^\ we 
have 

(56) £^r-(l-Co(2^r)-i/=«),CoQ(*. V') < (1 + Cc)Er,CoQ{(t>, ^) 

Roughly speaking, <I>('?^ is the portion of (f) which concentrates in q. The Propo- 
sition is phrased using frequencies 2^ and 2-' rather than 1 and 2*^ in order to make 
it symmetric in (p and ip, as we shall also need the time reversal of Proposition 15.1. 
Proof The quantities r,R have the units of length, while the frequencies 2-' , 2-' 
have the units of inverse length. Finally, c and the margins are dimensionless. One 
can then verify that the entire Proposition is dimensionally correct and thus scale 
invariant. By scaling we may thus set j = 0. By translation invariance we may 
assume Q is centered at the origin. Set r := 2~'^R, where J is chosen so that 
r ~ y/R. 

We may assume that c > j?-i/io« ^fQj- instance), since otherwise the claim is 
trivial by setting $ = (and using some trivial bound such as (7) to show (54). 

Let f be a maximal l/r-separated subset of 5""^ fl S, and let L denote the 
lattice L := c~^rZ". We define a red tube to be any set T = T{lot, xt) of the form 

T = {{x, t) e R"+^ : \x - {xT + 0JTt)\ < r} 

where xt & L and lot G £. We let T = T'"^'* denote the set of all red tubes. If T 
is a red tube, we define the cutoff function xt on R""'"^ by 

(57) XT{x,t) :=XD{xT+u,Tt,t;r){x)- 

We shall need the following careful decomposition of (p into wave packets which 

arc concentrated on tubes in T. 

Lemma 15.2. With the above notation, one can find for each T G T a red wave 
(pT with frequency 1 and angular dispersion at most CR~^/'^, such that 
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• We have the margin estimate 

(58) margin{(j)T) > margin{(j)) — CR~^^^. 

• The map (pT is linear for each T, and 

(59) </, = ^ </,r. 

TeT 

• We have 

(60) E{<f>T)<c-^\\xT{t)mh 

for allT G T and t in the lifespan of CqQ. 

• // dist{T, Q) > CqR then 

(61) II<^t|Uoo(q) < dist{T, Q)^-^£(<^) V2. 

• We have 

(62) XT{x„t,)-^M\l-ic<i)^c~''rE{4>). 

• We have the Bessel inequality 

(63) {Y.^iY.^'^oMf' < {l + Cc)E{cl>f'^ 

10 TeT 

whenever go runs over a finite index set and the mq^^T are non-negative 
numbers such that 

(64) 

for allTe T 

Roughly speaking, (j)T is the portion of which has frequency support in the 
sector of width 1/r and direction {lot, 1), and is spatially concentrated in T. A 
naive microlocalization to this region of space and frequency, taking some care to 
ensure that the (/)t are still waves, would obtain most of the above properties, but 
would probably need to replace the (1 + Cc) factor in (63) by a larger constant, 
which would then cause a similar unacceptable loss in (22) and then destroy the 
induction. This necessitates a delicate construction of the 0t based on averaging. 
Because the details of the proof are technical and not particularly relevant to the 
rest of the argument, we defer the proof of this Lemma to Appendix I, and continue 
with the proof of Proposition 15.1. 

Using Lemma 15.2, we can now define $ by 



90 



65 := V 

^ ^ Tot 



for all qo & Q, where 



(66) TO,„T := llV-Xrlli^.,. + R-''^E{i;), 



ENDPOINT RESTRICTION FOR THE CONE 



27 



and 

(67) mT:= ^ m,„,T = ||VXT||i2(Q) + i?-^°"2("+i)^<'i;(V). 

qoeQcoiQ) 

The R^^'^'^E{ij)) factor is only present in (66) to ensure that m.T does not completely 
degenerate to zero. One can think of as consisting of those wave packets 0t 
such that Vxt concentrates in go- 
It is clear that $ is a red wave with 

margin($) > margin(^) — CR~^^'^ 

and that 

(68) (l>= 

qoeQcoiQ) 

The estimate (55) follows from (63). 

We now show (56). Let D = D{xo,to;r{l — Cor^^/^^)) be any disk of radius 
r(l — Cor~^/^^) with to in the lifespan of CqQ- By (55) it then suffices to show 
that 

(69) < il + Cc)Er^,CoQ{(t>,i^)- 

Let D' denote the slightly larger disk D' D{xo,h,r - ^r-^/^^), and let D" 
denote the even larger disk D" D{xq, to,r). We may divide 

Cf> = PD'(f> + {1 - PD')(f> =■■ (f>l + (l>2- 

The map ^ $ is linear, so we may write $ =: $i + $2 accordingly. From (55) 
and (36) we have 

(70) 

W^ih^D) < E{^,y/^ < (1 + Cc)E{<l>,y/^ < (1 + Cc)\mL2^D") + r''-''E{cP). 
Next, we claim that 

(71) \\^2\\L-iD)<r^-''E{cl>). 

To see this, we first consider the tubes T which do not intersect CQ. By (65), (61), 

and Holder, their contribution is acceptable. Thus wc need only consider those 
tubes which intersect CQ. By the triangle inequality it suffices to show 

ll(02)T||L^(D)<r^-^£^(<^) 

for each tube T. 

First suppose that dist(T,Z)) < iji/2+i/iooJV -jj^g^ (35-)^ (gg)^ ^^le as- 
sumption r > ii^/^+i/^ we see that 

\\XT{to)Mto)h<r^-''E{^). 

The claim then follows from (60). 

Now suppose that dist(T, D) > i?i/2+i/iooJV_ (62), the hypothesis c > R-^/^° 
and Bernstein's inequality (or Sobolev embedding) we see that 

UhML-i,) < r^-''E{ct>2) < r^-''E{ct>) 
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whenever dist(g,T) > _rV2+i/200JV rpj^^ ^^^^ follows. This completes the 

proof of (71). 

Combining (70) and (71), we see that 

mimD) < (1 + CcmiLHD") + r'-''E{^), 

and (69) easily follows (using the trivial inequality || "01^2(0) ^ This 
proves (56). 

We now turn to the proof of (54). By (68) we have 

(|<^| - [^ICo)^ < E l^^'°Vl(l-X,o), 

so by the triangle inequality it suffices to show that 

(72) ||*('''VlL2(7c,oo(Q)\,„) < c-^r-("-i)/2i;(</,)V2£;(^)i/2. 

for each q in Qco(Q)- 

Fix qo . We shall use a (heavily disguised) version of the arguments in Wolff [25] . 
If g e Qj{Q) intersects I'^'^°{Q)\qo, then dist(g', go) ^ cR. By squaring (72), we 
thus reduce to showing that 

(73) E ll^^'^Vlli^c,) < c-^r-("-i)£(</.)£;(V). 

9eQj(Q):dist(9,qro)>cfl 

Consider a single summand from (73). From (65) and the triangle inequality we 

have 

(74) ||$(*VlU2(,) < E "^Ut^lh,). 

Consider the tubes T which do not intersect CQ. By (61) and (26), their total 
contribution to (74) is 0{R^-^ E{(j)y/^ Eii;)^/^). As for the tubes T which do 
intersect CQ, we may apply Lemma 14.3 (since r 3> 1, 2^^ ) to obtain 

The total contribution of the error term to (74) is 0{R^-^ E{(t))'^^'^E{iJj)'^^'^) by 
(63) and the fact that there are only 0{R'^') tubes being summed here. Combining 
all these estimates we obtain 

II^^^^VIU^W <r-("+^)/2||^|U.(c,) E '^UT\\LHC,)+R''-''E{<l>y/'E{i^y/\ 

— JflT 

TeT 

Inserting this back into (73), we see that it suffices to show that 
(75) 

E \mUc,)i E "^WMl^c,))' < c-^r^E{<i>)E{i,). 

<?eQj(Q):dist(9,go)>c-R TeT 

Using the trivial estimate 

1/2 
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followed by Cauchy-Schwarz, we have 



By (66), we have 

TeT 

where 

TeT 

Since dist(g, go) ^ C-R, we see from elementary geometry that 

Xix, t) < c'^'il + dist((a;,^),C''"'^(a;g,t,)) ._io„_ 
From Lemma 13.1 and the triangle inequality we thus have 

Inserting all these estimates back into (75), we see that it will suffice to show that 

(76) E wW«)E=^^^-"^^w- 

leQAQ) ^^^mTXT[Xg,tg) 

We re-arrange the left-hand side as 

^ ^ ~ ^ . ^-3|u ii2 H^Hi^(Cg)Xr(a;g,tg)^ 
TeTqeQjiQ) ^ 
and estimate this by 

\l^Cg)XT{Xg,tqf 



E( sup XT{Xg,tg) ^WMhiCg)) E 

Since the inner sum is 0(1) by (67), the desired estimate (76) then follows from 
(62). This proves (54). ■ 

We define the wave table ^c{<P, i^'i Q) in analogy to $c('A, V'i Q) by time reversal: 

*,(0,^;Q) :=T$,(TVaT0;TQ). 

Of course, one has the analogue of Proposition 15.1 for \1/ but with the roles of red 
and blue reversed. 
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16. Proof of Proposition 4.1 

We are now ready to prove Proposition 4.1, in which (jnl) is replaced by a quilted 

analogue [$]fc[5']co- 

We begin with the construction of <&; this will be achieved by iterating Proposi- 
tion 15.1. 

Wc define recursively the wave tables (fij on Q of depth j for all < j < k which 
are multiples of Co (recall that we assumed A; to be a multiple of Cq in Section 3). 
Set 00 to be the wave table on Q of depth 

and then define inductively 

<^;^Co :=*c2-('-.)/«(0^'\^;'7) 

for all q G Qj{Q)- We then choose $ to be $ := 
By induction we see that 

margin(<^,) > - {2^ / Rf'^ - C{2i/Rf'^ 

for each j. This gives the desired margin requirements on $ since R 2^^. Prom 
(55) we have 

E{(t>j) < (1 + Cc2-('^-^)/^)i;((/.j_i) 

for all Co < j < k. Telescoping this, we obtain 

(77) E{(Pj) < (1 + Cc)E{ct>) = 1 + Cc 

for all j. Thus (22) holds for $. A similar iteration of (56) (absorbing any factor 
of k which appears into a small power of r^^^) yields 

(78) £;,(i_,-i/3«),Coq(*, V') < (1 + Cc)Er,CoQ{(p, i') 

for all r > i?i/2+2/iv. 

Let Co < i < A; be a multiple of Co, and let q* be a cube in Qj{Q). Prom (54) 
with R replaced by 2^~^R, Q replaced by q*, and (p replaced by (p'f \ we see that 

- [^,+CoUM\LHxiQ)nr) < c-^2^^'^-^y^{2'-=R)-'^E{cpf^y/'. 
Square-summing this in q* and using (77) we obtain 

(79) \m,], - [cp,+coUcomLHxiQ)) < c-^2^(^-^y^{2-m)-'^. 

On the other hand, from Lemma 5.1 with R replaced by 2''R and (j) replaced by 
we have 

ll[</'.L-^llLi(X(Q))<2-^/^i?. 

Replacing j by j + Cq and then subtracting, we obtain 
Interpolating this with (79) using (8), we obtain 

\\m^-[<^^+CoUcoM\\LHxiQ)) < c-^2^(*=-^-)/^2-^-(f-^) = c-^2-«(^-^-)2'=(i-^) 
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for some constant a > 0. Telescoping this with the triangle inequality, we obtain 

(80) WMlLvixiQ)) < \mkHL.{xm + c-^2>'^^-il 
Having constructed we now define ^ as 

*c($,^;g). 

The estimate (22) for ^ follows from the time reversal of (55). It remains to 

prove (23), (24), and (25). 

To prove (23), wc observe from the analogue of (54) that 

ll<f(IV'l - [*]c„)||l^(x(q)) <c-^2^^o(i?/2'=)-("-i)/4i;($)i/2£;(^)i/2. 
From (22) and (18) we thus have 

(81) 1|[$],(|^| - [*]co)|U=(x(Q)) < c-^2^^o(i?/2'=)-("-i)/l 
From Lemma 5.1 we have 

(82) \mk\\L^xiQ))<2-''R'/^ 
Prom (22) we thus we have 

\mkiA\LHxiQ)), mmiLnxiQ)) < ^-'/^r 

so from (18) and the triangle inequality we have 

\mkm-mco)\\LHxiQ))<2-''/'R. 

Interpolating this with (81), we obtain 

(83) \mM - mco)\\Lnxm < c-^2^^°2'=(^-^), 

and (23) follows from this, (80), and the triangle inequality. 

Now we prove (24). We shall assume that 2^r < R since (24) follows from (23) 
otherwise. For brevity let fl denote the region 

n:=X{Q)r\CP^^P'%Xo,to;r). 

We will need to localize (80) and (83) to O. We begin with the localization of (80). 
By (18) and Corollary 13.2 we have 

II VIUhh) < UMlho) < r'/'R'/' < {2''r/Ry/'2-iR. 
Replacing j by j + Co and then subtracting, we obtain 

- [<f>j+Co]j+ComLHn) < {2^rlRf'^2-iR. 
Interpolating this with (79) using (8), we obtain 

- ['/'i+Co],+Co)V'IU.(n) < c-^2^('=-^-)/^2-^(F-^)(2'=r/i?)V^. 
Telescoping this with the triangle inequality, we obtain 

(84) < ||[$]fc0|Up(o) + Cc-^2^^<'2'=(i-^)(2'=r/i?)i/^. 
Now we localize (83). From (18) and Corollary 13.2 we have 

||[$]fc^||Li(o),||[$]fc*||Li(n) <r^/'i?'/' = {2\/Rf 1^2-^1^ R. 
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SO from (18) and the triangle inequality we have 

- [*]co)IUi(a) < {2'^r/R)'/'2-'^/'R. 
Interpolating this with (81), we obtain 

mhrn - [*]c„)||l.(0) < (2V/i?)i/^c-^2C^''2'=(F-i), 

and (23) follows from this, (80), and the triangle inequality. 

Finally, wo show (25). From the time- reversed version of (56), and conceding 
various powers of 2*^ and r^/^, we see that 

£:,.(i_^-i/3«),Coq(^. *)<(! + Cc)£;^(i_^i/2N),Co(3(*' i') 

for all r > Combining this with (78) we obtain (25) as desired. This 

completes the proof of Proposition 4.1, and hence of Theorem 1.1. ■ 

17. Null form estimates 

In this section we now apply Theorem 1.1 to obtain some nearly sharp null form 
estimates. The arguments in this section do not require the methods used above 
to prove Theorem 1.1. 

Let Do, D+, D- denote the Fourier multipliers 

Q(c,t) := m + \T\m,T) 

D:^{i,r) :=||?|-|r||0(e,T). 

If is a solution to the free wave equation and s e R, we define ^[0] to be the 
vector {(j){0),DQ^4>t{0)). In particular, we have 

\\m\\H^ = {\\m\\%s + Utm\%.-.y/' 

where := Dq^L"^ is the homogeneous Sobolev space of order s. 
In [7] the following problem was considered: 

Problem 17.1. Determine the set of all exponents {p, /3o, /?+, /3_, ai, 0:2) such that 
one has the estimates 

(85) \\D^o°Di^Dt{m\L.^R'^+^) < wmwH^-. \\m\\H<'. 

for all vector-valued solutions <p, tjj to the free wave equation (not necessarily red or 
blue). 

This problem was resolved in [7] for p = 2, but is largely open otherwise (with 
some partial results in [23], [13]). A successful resolution to this problem (and 
of its generalization to mixed Lebosguo norms LfL"^ and to when (f>, ijj solve the 
inhomogeneous wave equation) is likely to have application to the low-regularity 
behaviour of non-linear wave equations. 

To see the connection between Problem 17.1 and Theorem 1.1, we first observe 
the consequence of Theorem 1.1, which addresses Problem 17.1 in the frequency- 
localized case. 
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Proposition 17.2. Let P be a real number, and k, I > be integers. Let (j) and ip 
be waves which have frequency supports in the sectors 

(86) {(6,6,r,T) : r = ~ 1,^2 ~ 2"', j^l < 2"'} 
and 

(87) 2'={(^i,6,r,T) : r = ~ 1,6 ~ '^-',^'1 < 2"'} 
respectively. Then we have 

(88) 

III □ |^((/.V')||p, III □ |'^(#)||p < 2'3('=-202H^-("-i))2Mi-H^)i5(^)i/2i5(^)i/2 

for all 2 > p > po, e > 0, where \ D\ is the multiplier | □ | := D^D^, the energy 
E{-) is defined as in (1), and the implicit constants may depend on (3, e. 

Proof Consider the Minkowski- confer mal linear transformation L : R""*"^ — > R""*"^ 
given by 

and define the associated operator by 

TLd^:=(j)oL*, 

where L* is the adjoint of L. A routine computation shows that Ti|n| = 2~^-' |n|ri, 
so that 

Tl(\ □ fH) = 2-'^'^| □ f{TL(j>){TLi^). 
Also, we observe from Plancherel that E{Tl4>) ~ 2("-i)Ji;((/)) and E{Tl^I)) ~ 
2("-iW£;(V'). Finally, we have \\TlF\\p = 2-("+i)'/P||F||p. Combining all these 
facts we see that to prove (88) it suffices to do so when I = 0. 
Set Z = 0. We can write 

I □ f{<i»P){x, t) = 2^H^^ J J e2"-(2S+«)e2..t(|c|-2''|r,|)^^^^ v)f{09{2''v) d^dr, 

where f{x) := (j}{x,Q) and g{x) := V'(a;,0), and m is the symbol 
m(6r?) := Ca{Oamm+Mf- 

Since m is smooth and compactly supported, we may decompose m as a Fourier 
series 

for Id, kl ^ 1, where L is some discrete lattice and Cjj' are rapidly decreasing 
co-efficients (uniformly in k). This implies that 

£)^-(,/>^) = ^ Cjj>(l)jipj> 
3,3'eL 



where 



b,{x, t) := 4>{x - J. t), i,^, (x, t) := V^(.x - 2" V- 1)- 



The first claim of (88) then follows from this decomposition, the triangle inequality, 
(3), and the observation that <pj, tpj' have the same energy as </>, tjj respectively. 
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The second claim is proven similarly but also uses the observation that (f)jipj' has 
the same magnitude as (f'ji'j'- ■ 

Wo make the technical remark that the above estimates continue to hold if 
one replaces /3 by the complex number /3 + it, with constants that grow at most 
exponentially in t. 

We now consider the general setting of Problem 17.1, where no frequency re- 
strictions are assumed on <p or ip. 

By considering a large number of key examples, it was shown in [7] that the 
conditions 

n + 1 

(89) /3o +/?++/?-= ai + as + n 

P 

(90) p>po 

(«) "-^^-^ 

(3o > n 

V 

(92) /3o > - (n + 1) 

V 

(93) ai + a2 > ^ 



n + 3 

(94) ai + Q!2 > n 



P 

P 



n — 1 n + 1,1 1, 

(95) a,</3_ + ^ + (n + 2)(i-i) 

were necessary for (85), where i = 1,2. In that paper it was also shown that the 
above conditions were also necessary when p = 2, except for the endpoint cases 
when one has equality in (92), [(91) and (93)], or [(91) and (95) for some i = 1,2], 
with (85) being false in these cases. It was then conjectured that for the above 
conditions are similarly necessary when 2, except perhaps for some endpoints. 

By combining Proposition 17.2 with dyadic decomposition arguments as in [23], 
we can obtain the following progress on this conjecture in the p < 2 case. 

Theorem 17.3. Let Po < p < 2. Then (85) holds whenever (89) holds, (91), 
(92), (95) hold with strict inequality for i = 1,2, and the conditions (93), (94) are 
replaced by the more strict 

1 n + 3,1 1, 

2 n — 1 p 2 

Ifp^PQ, then one can let (92) he obeyed with equality. 

This is sharp except for endpoints when n = 2 (since (96) nearly matches (94) 
in this case), and is somewhat sharp for n > 2. One can replace the multipliers 
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Dq, _D_ by other symbols which satisfy the same types of regularity and decay 

estimates, but wc shall not pursue this matter here. 

Proof In order to apply the complex interpolation method we shall allow the 
indices /3o, P+, P- to acquire an imaginary part it. For notational simplicity we 
keep ourselves to the case t — 0, but the reader may easily verify that the following 
argument also works for arbitrary t with constants which are at most exponential 
in t. 

When p = 2 these claims were proven in [7] (see also [23] ) . Since we have replaced 
(93), (94) with the linear condition (96), we see by complex interpolation that it 
suffices to verify the claims when p = po. In this case (92) becomes /3o > 0; since the 
operator Dq°D^^° is bounded in by standard multiplier theory we may assume 
that Po = 0. 

Let 0, ijj be solutions to the free wave equation. By a finite decomposition and 

time reversal symmetry wc may assume that (f> is supported in the upper light cone 
E+ and that ip is supported on either the upper or the lower light cone T,^. 

Write =: J2j ^j^ =• Sfe V'fc; where ^j, Vfe have frequency supports on the 
region 2-' , D+ 2*^ respectively. We now rewrite (85) as 

\\j2Di--^-\n f-{ci>ji.k)h ^ (E2'"^^'^^('^i))'/'(E2'"^'^(^^))'^'- 

j,k j k 

It suffices to show that 

\\dI+-^-\ □ l'^- (</.,• Vfc) lip < 2--l^-'=l(22«i^£;(.^,.))V2(22"2'=£;(^,))V2 

for all j, k and some e > 0, since the claim then follows from the triangle inequality, 
Cauchy-Schwarz, and Young's inequality for sums. 

By symmetry we may take j < k; by scaling and (89) we may take j = 0. Having 
used (89), our arguments will no longer require this condition and we shall discard 
it. Since we are assuming (95) to hold with strict inequality we may absorb the e 
into the ai factors, and reduce ourselves to showing that 

□ |/3-(0„^,,)||p < 2^'=2«='=i?(,^o)'/'i?(V^fe)'/'. 

Fix k. For any I > 0, decompose the double light cone into finitely overlapping 
projective sectors F of angular width 2~', and let <^o = X^r 'Po,r, V'fe = Z^r V'fc.r be 
a Fourier decomposition of V'fe subordinate to these sectors. It suffices to show 
that 

II Yl i?f"^-|n|''-(<^o,rV'fe,r')llp<2-^'2-'=2"^'=i5(0o)'/'i^(V'fc)'/' 
r,r':Z(r,r')'^2-' 

for alH > 0, since the claim follows by summing in I using the bilinear partition of 

unity based on angular separation (see e.g. [21], [23]). By the triangle inequality 
and Cauchy-Schwarz as before, it suffices to show that 

(97) \\Dl+-^-\a f-iMk,T')\\p < 2--'2-^2'-^^E{M^'^E{i,k,T'f/^ 

for each F, F' with angular separation 2^'. By a spatial rotation wc may assinnc 
that (j) has frequency support in (86), and that either or has frequency support 
in (87). 
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Now suppose k ^ 1. In this case D\^~^~ is equal to 2^ times a harmless 
multipHer on the frequency support of (l)o,ri>k,r' , so by Proposition 17.2 the left- 
hand side of (97) is majorized by 

2ft(/3+-/3-)2/3-(fe-202'(^-(»-i))2'=(?-Hs)£;(^o^j,)i/2£;(^^^^,)i/2_ 

The claim (97) then follows after some algebra from (95), (89), and the assumption 
that (91) holds with strict inequality. 

It remains to consider the case k = 0(1); by a mild Lorentz transformation we 
may make k = 0. (This affects slightly, but this change is irrelevant). Since F 
and r' differ in angle by 2~', some geometry shows that is at least 2'' on the 
frequency support of (jyip- We may thus majorize iniPby l + 2'(^+-'3-), 

and by Proposition 17.2 again the left-hand side of (97) is bounded by 

(1 + 2'('3+-'3-))2^-(-202'('^-("-i))i;(</,o,r)i/'i;(Vo,r')'^'- 

The claim (97) then follows after some algebra from (96), (2), (89), and the as- 
sumption that (91) holds with strict inequality. ■ 

It seems plausible that many of the missing endpoints in the above result could 
be obtained if one was willing to prove a generalization of Proposition 17.2 which 
considered the interaction of multiple scales and multiple angular separations using 
more sophisticated tools than the triangle inequality, and in which the 2^*^ loss was 
eliminated. 

Apart from the issue of endpoints, there is still the unsatisfactory gap between the 
condition (96) in Theorem 17.3, and the necessary conditions (93), (94) conjectured 
in [7] , when n > 2. In particular, to resolve the conjecture we would need to consider 
exponents near the case 

n + 2 

(98) p= =po{n-l), ai+a2 = l/p, 

n 

which is the intersection of (93) and (94). This case turns out to be related to the 
open (and quite difficult) 

Conjecture 17.4 (Machedon-Klainerman for the Schrodinger equation). 

Let (j>, tp be functions on R"'~^^ which have frequency supports in 

and 

{(e,i|?n:Ce-s} 

respectively. Then (4) holds for all p > pQ{n). 

Proposition 17.5. Suppose that (85) holds for some set of exponents satisfying 
(98). Then Conjecture 17A holds with n replaced by n — 1. 

Proof We use the method of descent, exploiting the fact that the paraboloid in 
R" is a conic section of the light cone in R""'"^. 
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To prove Conjecture 17.4 for n— 1, it suffices to verify it for p = pQ{n—l) = 
since the other endpoint p = oo is trivial From Plancherel's theorem it suffices to 
show that 

(99) 

—71 — 1 — ?1 — 1 

for all C°° functions /, 5 on Tlu-i- 

Fix /, g. Let i? ^ 1 be a large number, and consider the functions (pR, ipR 
defined on R"+^ by 



and 



^R{xux',t) := J' J e-2-(*+-i)(«+«)e2"'(*-"^)wiTe2--'-«5(^) d^ds. 
One can easily verify that (j)R and ipR solve the wave equation, and that 

By the assumption that (85) holds at (98), we have 

\\D^,"D''_-Dl+{<t>n\p<R'^mh\\gh 

for some /3o, /?-, P+- However, a computation of the frequency support of 
shows that Dq, are all given by smooth functions comparable to 1 on this 

support, and so we have 

ll</'V'llp<i?'/^ll/l|2||5l|2 

Making the change of variables X = t + xi, T = [t — xi)/2R, we have 

fiOdiv) d^drjdsds' 

and so we have 

\\mLlLl,L^^ < Wfhhh, 

the R^/P factor having cancelled against the Jacobian term. 

The phase factor 2^^^^-^^ can be discarded. Letting R ^ 00 and taking limits, 
and then evaluating the s, s' integrations, we thus see that 

and (99) follows since the X behaviour is trivial. ■ 

The Machedon-Klainerman conjecture for the Schrodinger equation looks very 
similar to the results proved in Theorem 1.1 with k = 0, however the method of 
proof breaks down because the strong Huygens' principle ((40) and (41)) totally 
fails for this equation. The arguments in [25] fail for similar reasons (basically, 
the tubes T are no longer constrained to point in null directions). Only partial 
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progress is known for this problem; for instance, when n = 2 this conjecture should 
hold for all p > 2 — i, but the best result achieved to date is p > 2 — see [22]. 
A resolution of this conjecture would also yield new results for the very difficult 
restriction and Bochner-Riesz problems for the paraboloid [21], [6] as well as the 
problem of pointwise convergence of the Schrodinger equation to the initial data 
[23]. 

The estimate (85) at the cndpoint (98) shares some features in common with 
the conjectures (29), (30) in [25], and the resolutions to these problems may well 
be related. 

18. Appendix I: Proof of Lemma 15.2 

We now give the proof of Lemma 15.2. The main technical difficulty is to obtain a 
good constant in the main term of (63); this will be obtained by using characteristic 
functions to decompose the Fourier domain, followed by an averaging over rotations 
to smooth things out spatially. 

Partition 

where A,^ consists of those points in S*""^ n S which are closer to w than any other 
element of £. Thus A^ is in the 0(l/r)-neighbourhood of w. 

Let G C SO{n) denote the set of all rotations in R" which differ from the 
identity by 0(l/r). Let (Kl be a smooth compactly supported probability measure 
on the interior of G. 

For each fi G G and ui G £, we define the Fourier projection operators Pq^ui for 
test functions / on R" by 

:=Xa(A„)(e/iei)/(0- 

Note that 

(100) 0(0) = ^ Pn,u.m 

uies 

for all n € G and red waves cj) of frequency 1 . 

The decomposition (100) is well-behaved in frequency, but has terrible spatial 
localization properties. To get around this we will now average in dCl and then 
apply a spatial cutoff. 

Recall the function rjo constructed in Section 10. Write 

?7^«(a;) :=rio{ — {x-Xo)) 
for all xq G L. From the Poisson summation formula we have 

1= J2 

We define the functions (j)T at time by 

(101) </'t(x,0) := 77^-^(2;) / {Pn,^^m){x) 



ENDPOINT RESTRICTION FOR THE CONE 



39 



and at other times by 

Mt) ■■= f/(i)0T(O), 

where U {t) was the evolution operator defined in (31). One may veriiy that (pT is a 
red wave with angular dispersion 0(l/r), that the map (j) i-^ <j)T is linear, and that 
(58), (59) hold. 

We now show (60). From (101) and the rapid decay of 77^^ we have the pointwise 
estimate 

(102) </.t(0) < c-^xt(O)^ J \Pn,^^m\ dn, 

so it suffices by Minkowski and the boundedness of Pn,ojT to show that 
||XT(0)Pn,.,<A(0)||2 < C^\\xT{t)Pn,u,^<P{t)h. 

for all Cl G G, since the Cq factor can be absorbed into the factor by the 
hypothesis c < 2"*^°. In fact, we will show the stronger 

(103) \mL^D{xD,tn;r)) ^ C'o ||XD(xD-(t-tD)a'T,t;r) lb 

for all disks D of radius r and time co-ordinate to = 0{R), and any red wave 
(j) with frequency support in the sector {(C,|CI) • 1^1 ~ 1!^(C)<^t) ^ C/r}. The 
previous claim follows by breaking up Xt(0) into various disks D. 
We now show (103). We have the identity 

</.(x,iz5) = j e2-(-«+(*-*°)l«l)(^(?)^)(C) di 

where </? is an arbitrary bump function which equals one on the sector € S : 
Z(^, wt) ^ 1/?"} and is adapted to a slight enlargement of this sector. By standard 
stationary phase estimates and the observation that to — t = 0{CQr'^), we thus 
have 

cpito) = m * K 

where the kernel K satisfies the estimates 

\K{x)\ < Co^ri-"(l + \x-{t- tz5)^T|/r)-^"'(l + \{x - {t ^ tD)LOT) ■ c^tD"^". 

The claim (103) then follows from a dyadic decomposition of K around the point 
{t — tD)oJT, followed by Young's inequality. This proves (60). 

We now show (61). Write R' := dist(T,Q). Let D denote the disk D = 
D{0,0;C^^R'). Since R' » R, it suffices from (32), (33), and crude estimates 
to show that 

UT\\LHD)<R'~''E{cl>y/^. 

But this is clear from (101) and the estimates 

Wv'^^Wloo^d) < R''"", ||Pa,a,.</'(0)||2 < E{cl>y/\ 

This proves (61). 

We now show (62). From (103) the left-hand side of (62) is majorized by 

r^supxT{xq,tg)-^\\x%^ +^ o.^^cI)t\\1 
T 1 
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From (102) this is majorized by 

c-^r^ sup XT(x„t,)-3|lx?j(,,+t,,,,o;.)XT(0)^ / \Pn,u:^m\ dnWl 
Prom (57), (6) we have the elementary inequahty 

XT{Xq, tq) 

SO we may bound the previous by 

Summing in xt, we reduce to showing that 

Ell / \Pa,.m\dn\\l<E{ct>). 



But this foUows from Minkowski's inequahty followed by Plancherel's theorem. 
We now show (63). We expand the left-hand side as 

(EII/E E "^«o,T(a.,.o)^"°^n,.(0(O)) dn\\lY'\ 
go ojeS xoEL 

By Minkowski's inequality this is less than or equal to 

(104) / (EllEE"^«o,TK.c)^""^n,.(</'(0))||^)i/^ dn. 



go w xo 
jn— 1 



Define the set F C 5"-^ by 

Y := \J {a e : dist(a, 5""^^^) > Cc'^/r}; 



this set Y plays a similar role here to the sets /"■''(Q) used in Proposition 4.1. and 
define Pn(Y) to be the multiplier 

:=Xn(r)(||)/(0- 

By the triangle inequality, (104) is less than the sum of 

(105) / (EllEE"^'?c.T(a.,xo)r/^°^n.a.Po(F)</'(0)||^)^/2 d^. 



90 <^ a:o 



and 



(106) /(E II E E '"go,T(..,.o)^"°^n,.(l - PniY))m\\iy^^ dn. 



go ll> xo 



Consider the quantity (105). The contributions of each u are orthogonal as ui varies, 
so we can rewrite (105) as 

/ (EEllE"^'.o,T(.,xo)^"°^a,a,Pn(r)<^(0)||i)^/^ dQ, 



go w Xo 
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which can be re-arranged as 



This is clearly less than or equal to 



^ u ^ go xo 

Summing in qo and then in xo, this simplifies to 

/(E/ l(^n,a,Pn(r)<^(0))(a;)p dx^/^ dn. 

By the orthogonality of the Pn,oj , we thus have 

(105) < J \\PniY)mh dn < E{cl>f'^. 

Now consider (106). Repeating the above argument, but noting that we must 
use almost orthogonality instead of orthogonality, we obtain 



(106) < j ||(i-Po(y))</.(0)||2 dn. 

By Cauchy-Schwarz we thus have 

(106) <(| \\{l-Pn(Y))m\\ldnY'^; 

by Plancherel we thus have 

(106) < ( j j Xa(r)c(||) dn) d^" ■ 

The inner integral can be seen to be O(c^) for all ^, so by Plancherel again we have 

(106)<cE(^)V2. 

Combining our estimates for (105) and (106) we obtain (63). ■ 

19. Appendix II: Proof of Lemma 11.1 

To complete the proof of Theorem 1.1 wc have to prove Lemma 11.1. This shall 
be a straightforward application of the wave packet decomposition Lemma 15.2 and 
the energy estimates on cones developed in Section 13. However, our arguments 
arc not as delicate as those in the rest of the proof of Theorem 1.1, as Lemma 11.1 
is not an endpoint estimate. For instance, we will be able to lose powers of E}/^ 
in our estimates. 

Wc turn to the details. Let R > 2'^"'^, and let (j), ijj be red and blue waves of 
frequency 1, 2*^ respectively and margins at least 1/200. We may assume the energy 
normalization (11). By translation invariance it suffices to show that 

||(PD(/')V'l|LP(g"""(0,0;K,2i{)), ||'/>(-PDf/')l|LP(Q"""(0,0;ii,2K)) ^ R~^^'-' 
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for any disk D centered at the origin with radius 2'^ < r < Coi?^^^"*"*/^. We shall 
only prove the estimate for {PD(j))ip; the estimate for (j){P£>ip) is proven similarly, 
observing that any additional powers of 2^^ which appear can be absorbed in to the 
R-^/^ factor. 

As in previous arguments, we obtain the estimate by interpolation between 
an estimate and an estimate, namely 

(107) \\{PDm\L^iQ'^--mR,2R)),< R'"''R"' 

and 

(108) ||(Pl5<^)V'IU^(Q<"".(0,0;fl,2fl)), < 

By interpolation and some algebra involving (8) one obtains the desired resiilt if 
N is sufficiently small. The point is that (107) improves over (27) by a substantial 
power of R. 

We first prove (107). The contribution outside of the set C"''=''(0, 0; i^^/^) jg 
acceptable by (40), and the contribution inside this set is acceptable by Corollary 
13.2. This proves (107). 

Now we prove (108). Set Q := Q{0,0;2R). We use Lemma 15.2 with this cube 
Q and some arbitrary value of c (say c = 2~'-^o) to decompose {Pd4') a^s 

(PZ50) = iPD4>)T. 

By (61) and crude estimates (e.g. (7) or Lemma 14.2) the contribution of those 
tubes T for which dist(T, 0) » i? is acceptable, so we can restrict ourselves to those 
tubes T for which dist(T, 0) < R. There are only 0(i?^) of these tubes. 

We now dispose of those remaining tubes T for which i?i/2+i/A' < dist(T, 0) < R. 
For such tubes T we see from (60) with t = and (34) that 

E{{PD<t>)T) < I1xt(0)Pd</'(0)|1^ < i?-^i?(0) = i?-^, 

and one can dispose of the contribution of these tubes by crude estimates. 

It remains to deal with those tubes for which dist(T, 0) < In this case 

we want to exploit the fact that (Pd(/))t is concentrated on R^/^T, the dilate of T 
around its center by R^^^ . Indeed, from (62) we see that 

\\{PdMlhq\r^/-t)<R-'', 

and so the contribution outside R^I^T is acceptable by crude estimates. It thus 
remains to control the expression 

II y^(-PD'/')TV'Xfli/'VTll-L^(Q<"'"(0,0;fl,2fl))- 
T 

We now observe the geometric fact that as T ranges over all tubes in T with 

dist(T,0) < the tubes Xb^/^'t 

have an overlap of at most 0{R'^^^) in 
(5"""(0, 0; i?, 2i?)). By almost orthogonality, it thus suffices to show that 

(^||(Pi.<^)TVII^)^/^<i?-("-^)/^ 
T 
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Applying Lemma 14.2, we can bound the left-hand side by 

T 

But this is acceptable by (55) and (38). ■ 
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